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Abstract 

We address the estimation of conditional quantiles wiien the covariate is functional and 
when the order of the quantiles converges to one as the sample size increases. In a first time, 
we investigate to what extent these large conditional quantiles can still be estimated through 
a functional kernel estimator of the conditional survival function. Sufficient conditions on the 
.. _ rate of convergence of their order to one are provided to obtain asymptotically Gaussian dis- 

^ ' tributed estimators. In a second time, basing on these result, a functional Weissman estimator 



is derived, permitting to estimate large conditional quantiles of arbitrary large order. These 
O^ ■ results are illustrated on finite sample situations. 

(n: 
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^ . 1 Introduction 



Let (Xi, li), i = 1, . . . , n be independent copies of a random pair (X, F) in _B x R where E is an 
infinite dimensional space associated to a semi-metric d. We address the problem of estimating 
q(an\x) £ R verifying P(F > q{oLn\x)\X = x) = an where a„ — > as n ^ cx) and x € E. In such 
a case, q{an\x) is referred to as a large conditional quantile in contrast to classical conditional 
quantiles (or regression quantiles) for which a„ = a is fixed in (0, 1). While the nonparametric 
estimation of ordinary regression quantiles has been extensively studied (see for instance [351 
l39] or [18], Chapter 5), less attention has been paid to large conditional quantiles despite their 
potential interest. In climatology, large conditional quantiles may explain how climate change over 
years might affect extreme temperatures. In the financial econometrics literature, they illustrate 
the link between extreme hedge fund returns and some measures of risk. Parametric models 



arc introduced in [10[ 138] and senii-paramctric methods are considered in [31 I31| . Fully non- 
parametric estimators have been first introduced in [21 H] through local polynomial and spline 
models. In both eases, the authors focus on univariate covariates and on the finite sample properties 
of the estimators. Nonparametric methods based on moving windows and nearest neighbors are 
introduced respectively in [23l [25] and [24] in the fixed design setting. We also refer to [IS] . 
Theorem 3.5.2, for the approximation of the nearest neighbors distribution using the Hellinger 
distance and to [12] for the study of their asymptotic distribution. 

An important literature is devoted to the particular case where the conditional distribution of 
Y given X ^ x has a finite endpoint (p{x) and when X is a finite dimensional random variable. The 
function (p is referred to as the frontier and can be estimated from an estimator of the conditional 
quantilc g(a„|a;) with a„ — >■ 0. As an example, a kernel estimator of ip is proposed in [27], the 
asymptotic normality being proved only when Y given X = a; is uniformly distributed on [0, ^(x)]. 
We refer to [33] for a review on this topic. 

Estimation of unconditional large quantiles is also widely studied since the introduction of 
Weissman estimator [41] dedicated to heavy-tailed distributions, WeibuU-tail estimators [12l [22] 
dedicated to light-tailed distributions and Dekkers and de Haan estimator [11] adapted to the 
general case. 

In this paper, we focus on the setting where the conditional distribution of Y given X = x has 
an infinite endpoint and is heavy-tailed, an analytical characterization of this property being given 
in the next section. In such a case, the frontier function does not exist and q{an\x) ^^ oo as q;„ ^^ 0. 
Nevertheless, we show, under some conditions, that large regression quantiles q{an\x) can still be 
estimated through a functional kernel estimator of P(y > .\x). We provide sufficient conditions on 
the rate of convergence of a„ to so that our estimator is asymptotically Gaussian distributed. 
Making use of this, some functional estimators of the conditional tail-index are introduced and a 
functional Weissman estimator |41j is derived, permitting to estimate large conditional quantiles 
q{l3n\x) where /3„ — > arbitrarily fast. 

Assumptions are introduced and discussed in Section [2] Main results are provided in Section [3] 
and illustrated on simulated data in Section |4l Proofs are postponed to the appendix. 

2 Notations and assumptions 

The conditional survival function (csf) of Y given X = x \s denoted by F{y\x) = P(F > y\X = x). 
The functional estimator of F{y\x) is defined for all {x,y) & E xM. by 

n / ^^ 

Fn{y\x) = ^i^(d(x-,XO//i)Q((K, -2;)/A) / ^if(d(x,X,)//i), (1) 

i=l / i=l 

with Q{t) = /_^ Q'{s)ds where K : IR+ -^ IR+ and Q' : M -^ IR+ are two kernel functions, and 
h ~ hn and A = A„ arc two nonrandom sequences (called window-width) such that /i — >■ as 



n -^ oo. Let us emphasize that the condition A — > is not required in this context. This estimator 
was considered for instance in |18) . page 56. Its rate of uniform strong consistency is estabhshed 
by |16] . In Theorem [T] hereafter, the asymptotic distribution of ([l} is estabhshed when estimating 
small tail probabilities, i.e when y = yn goes to infinity with the sample size n. Similarly, the 
functional estimators of conditional quantiles q{a\x) are defined via the generalized inverse of 
Fn{.\x): 

qnia\x) = F^Tia\x) = inf{t, F„{t\x) < a}, (2) 

for all a G (0,1). Many authors are interested in this estimator for fixed a G (0,1). Weak 
and strong consistency are proved respectively in |39j and j20| . Asymptotic normality is shown 
in [3J [3S1 SO] when E is finite dimensional and by [T7| for a general metric space under dependence 
assumptions. In Theorem [2l the asymptotic distribution of ([2]) is investigated when estimating 
large quantiles, i.e when a = a„ goes to as the sample size n goes to infinity. The asymptotic 
behavior of such estimators depends on the nature of the conditional distribution tail. In this 
paper, we focus on heavy tails. More specifically, we assume that the csf satisfies 



(A.l): F{y\x)^c{x)cxpS^- 1^ (^--^-e{u\x)^^y 



where 7 is a positive function of the covariate x, c is a positive function and |e(. |a;) | is continuous and 
ultimately decreasing to 0. Examples of such distributions are provided in Tablc[TJ (A.l) implies 
that the conditional distribution of Y given X = a; is in the Frcchet maximum domain of attraction. 
In this context. 'y{x) is referred to as the conditional tail-index since it tunes the tail heaviness of 
the conditional distribution of Y given X ^ x. More details on extreme-value theory can be found 
for instance in J14j . Assumption (A.l) also yields that F{.\x) is regularly varying at infinity with 
index —1/7(2;). i.e for all C > 0, 

limBM£)=C-^/^(-). (3) 

y^oo F[y\x) 

We refer to U for a general account on regular variation theory. The auxiliary function £(.|a;) plays 
an important role in extreme- value theory since it drives the speed of convergence in ([3]) and more 
generally the bias of extreme-value estimators. Therefore, it may be of interest to specify how it 
converges to 0. In [l][28], |e(.|a;)| is supposed to be regularly varying and the estimation of the 
corresponding regular variation index is addressed. 
Some Lipschitz conditions are also required: 

(A. 2): There exist k^, Kc, k^ > and mq > 1 such that for all {x,x') E E x E and u > uq, 

logc(a;) — logc(a;')| < Kcd{x,x'), 
\e{u\x) ~ e{u\x )\ < K.^d{x^x), 



1 1 



l{x) -i{x') 



< K^d{x,x'). 



The last two assumptions arc standard in the functional kernel estimation framework. 

(A. 3): K is a function with support [0,1] and there exist < Ci < C2 < cxd such that 
Ci <K(t) < C2 for all te [0,1]. 

(A. 4): Q' is a probability density function (pdf) with support [—1,1]. 
One may also assume without loss of generality that K integrates to one. In this case, K is called 
a type I kernel, see [18], Definition 4.1. Letting B{x,h) be the ball of center x and radius h, we 
finally introduce ipx{h) ■= V{X € B{x,h) the small ball probability oi X. Under (A. 3), the r-th 
moment fj,x (h) := ¥,{K'^{d{x,X)/h) can be controlled for all t > by Lemma [3] in Appendix. It 
is shown that fij {h) is of the same asymptotic order as ipx{h). 

3 Main results 

The first step towards the estimation of large conditional quantiles is the estimation of small tail 
probabilities F{yn\x) when ?/„ — > 00 as n ^- 00. 

3.1 Estimation of small tail probabilities 

Defining 






-1/2 



the following result provides sufficient conditions for the asymptotic normality of Fn{yn\x). 

Theorem 1 Suppose (A.l) - (A. 4) hold. Let x £ E such that ipx{h) > and introduce y„.j = 
ajy„(l + 0(1)) for j ~ 1, . . . , J with < oi < 02 < • ■ • < aj and where J is a positive integer. 
If Un ^^ 00 such that nipx{h)F{yn\x) — >■ 00 and nipx{h)F{yn\x){X/yn V hlogyn)^ — > as n -^ 00, 
then 

IM^M ")],-.. 

is asymptotically Gaussian, centered, with covariance matrix C{x) where Cjj' (x) = aJ/Zr for 

(j,/)e{l,...,jr. 

Note that nipx{h) F (y^lx) — > cx) is a necessary and sufficient condition for the almost sure presence 
of at least one sample point in the region B{x, h) x (?/„, 00) of i*^ x R, see Lemma 0] in Appendix. 
Thus, this natural condition states that one cannot estimate small tail probabilities out of the 
sample using Fn- Besides, from Lemma[3l A~^(x) is of the same asymptotic order as nLpx{h)F(yn\x) 
and consequently A„(a;) — > as n — !► c«. Theorem [T] thus entails Fn{yn,j\x)/ F{yn,j\x) — ;• 1 
which can be read as a consistency of the estimator. The second condition nipx{h) F {yn\x){X/ yn V 
hlogyn)^ — > imposes to the biases A/y„ and hlogyn introduced by the two smoothings to 
be negligible compared to the standard deviation A„(a;) of the estimator. Theorem [T] may be 



compared to |13j which establishes the asymptotic behavior of the empirical survival function in 
the unconditional case but without assumption on the distribution. 

3.2 Estimation of large conditional quantiles within the sample 

In this paragraph, we focus on the estimation of large conditional c^uantiles of order a„ such that 
rupx{h)an —^ooasn^ cxd. This is a necessary and sufficient condition for the almost sure 
presence of at least one sample point in the region B{x, h) x (g(a„|a;), cxo) of _E x R, see Lemma|4] 
in Appendix. In other words, the large conditional quantile g(a„|a;) is located within the sample. 
This situation is denoted as (S.l) by [25| in the fixed design setting. Letting 

a„(x) = nan (,. 

Lemma [3] shows that (Jn{x) is of the same asymptotic order as {nipx{h)an)~^''^ and thus the 
condition rufx{h)an — >■ oo is equivalent to cr„(x) — >■ as n — > cxd. 

Theorem 2 Suppose (A.l) - (A. 4) hold. Let x ^ E such that ipxih) > and consider a sequence 
Ti > T2 > • • ■ > Tj > where J is a positive integer. If an —5- such that (t„ (x) — > and 
a~^{x){X/q{an\x) V /iloga„) — > as n -^ oo, then 

'\qir,an\x) J } j=i,...,j 

is asymptotically Gaussian, centered, with covariance matrix 7^(x)E where Sjj' = ^/tj^^j' for 

Remark that (A.l) provides an asymptotic expansion of the density function of Y given X = x: 

f{y\x) - —-^^^(1 - e{y\x)) = ^^' ^ (1 + o(l)) 

l[x) y i{x) y 

as y ^^ CXD. Consequently, Theorem [2] entails that the random vector 

{ , i2), XM/J ^^J""*^^ ~ ^J"")) ^''^I{q{Tjan\x)\x) {qn{Tjan\x) - q{Tjan\x)) \ 
{{lJ>x'{h)Y/^ Jj=i...,j 

is also asymptotically Gaussian and centered. This result coincides with [3], Theorem 6.4 estab- 
lished in the case where a„ = a is fixed in (0,1) and in a finite dimensional setting. 

3.3 Estimation of arbitrary large conditional quantiles 

This paragraph is dedicated to the estimation of large conditional quantiles of arbitrary small 
order /3„. For instance, if nipx{h)f3n — > c G [1,cxd) then q(/3n\x) is located near the boundary of 
the sample. If n(^^(/i)/3„ — > c G [0, 1) then q{/3n\x) is located outside the sample. These situations 



are denoted respectively as (S.2) and (S.3) by [35] in the fixed design setting. Here, a functional 
Wcissman estimator [JT] is proposed to tackle all possible situations: 

C(/3«|x) = g„(a„|x)(a„//3„)>'(-). (4) 

Here, g„(a„|a;) is the functional estimator ([2]) of a large conditional quantile q{an\x) within the 
sample and ^n{x) is an estimator of the conditional tail-index 7(0;). As illustrated in the next 
theorem, the extrapolation factor {an//3n)^"^^' allows to estimate arbitrary large quantiles. 

Theorem 3 Suppose (A.l) - (A. 4) hold. Let x ^ E and introduce 

• an ^ such that cr„(a;) — > and (T^"^(a;)(A/q(Q:„|a;) Vft.loga„ Ve((7(a„|a;)|x)) —> as n —> 00 , 

• iPn) such that Pn/otn — >■ as n ^ oo, 

• jn(x) such that a^^{x){^n{x) — j{x)) — > N'{0,V{x)) where V{x) > 0. 
Then, 

l0g(an/Pn) V liPn\x) ) 

Note that, when K is the pdf of the uniform distribution, this result is consistent with [?5] . 
Theorem 3, obtained in a fixed-design setting. 

Let us now focus on the estimation of the conditional tail-index. Let a„ — !■ and consider a 
sequence 1 = ti > T2 > • ■ • > tj > where J is a positive integer. Two additional notations are 
introduced for the sake of simplicity: w = (1, . . . , 1)* G K'^ and v = (log(l/Ti), . . . ,log(l/Tj))* e 
R"^. The following family of estimators is proposed 

-0, N ^ 0(loggn(TiQ„|a;), . . . ,logg„(TjQ:„|a;)) 

^"^''^ 0(log(l/Ti),...,log(l/rj)) ' ^^ 

where : M"^ — !> R denotes a twice differentiable function verifying the shift and location invariance 
conditions 

4>{r]u + x) = 4>{x) 

for all > 0, ?7 G M and x E R-' . In the case where J = 3, ti = 1, r2 = 1/2 and T3 = 1/4, the 

function 

, r ^ J. ) =\q- / cxp(4x2) -exp(4a;i) 
Vexp(4x3) - exp(4a;2) 

leads us to a functional version of Pickands estimator |34] : 



log2 \qn{2an\x) - qn{4:a,i\x) J 



We refer to |26| for a different variant of Pickands estimator in the context where the distribution 
of Y given X = x has a finite endpoint. Besides, introducing the function mp{xi, . . . ,xj) = 



12'i=ii^j ~ 2;i)^ for all p > and considering (j>p{x) = nip (x) gives rise to a functional version of 
the estimator considered for instance in |37| . example (a): 

/J ^"^ 

t-i^) = I E [logg„(r,a„|.T) -logq^.la^lx)]" / E[log(l/r,)]P 

As a particular case (t>i{x) = mi{x) corresponds to a functional version of the Hill estimator |32j : 

,/ /J 

iti^) = XI [loggn(7-ja„|x) - logq„(a„|x)] / Elog(l/Tj) . 
i=i / i=i 

More interestingly, if {0*-^^ . . . , 0'^-'} is a set of H functions satisfying © and if A : R^ — ;> R is a 
homogeneous function of degree 1, then the aggregated function A{(f>^-^\ . . . , (f>^-^^) also satisfies (|6]). 
Generalizations of the functional Hill estimator can then be obtained using H = 2, Ap{x,y) = 



xPy^ P and defining 4>p^q^r — Ap{'Pq^ 4'r) = 'm: 



p/g^Ai-p)/r. 



m 



.p/g /^^ T , , , ,, \ (p-i)/*" 



AiVp.q.r ^r^\ _ _V / V 

(Ej=i [logg„(Tja„|a;) - logq„(a„|x)]''j (Ej=i[log(l/T?)]''j 

For instance, the estimator introduced by |29) . equation (2.2) corresponds to the particular function 
0p,p,i and the estimator of [5] corresponds to (j)p,pe,p-i- 

For an arbitrary function 0, the asymptotic normality of 7,^(x) is a consequence of Theorem[2l The 
following result permits to establish the asymptotic normality of the above mentioned estimators 
in an unified way. 

Theorem 4 Under the assumptions of Theorem [^ and if, moreover, a~^{x)e{q{an\x)\x) — > 
as n —>■ cx), then, a^^{x){'j^{x) — j{x)) converges to a centered Gaussian random variable with 
variance 

V^ix) = 2!M(V^(7(2;)«))*I](V0(7(^)«)). 

Let us note that the additional condition a~^{x)e{q{an\x)\x) — ^ is standard in the extreme- 
value framework: Neglecting the unknown function e(.|a:;) in the construction of 7,^(2;) yields a bias 
that should be negligible with respect to the standard deviation cr„(a;) of the estimator. Finally, 
combining Theorem [3] and Theorem 21 the asymptotic distribution of the functional large quantile 
estimator (j™''^(^„|a;) based on (|4]) and (O is readily obtained. 

Corollary 1 Suppose (A.l) - (A. 4) hold. Let x ^ E such that ipx{h) > and consider a sequence 
1 = Ti > T2 > • ■ • > Tj > where J is a positive integer. If 

• a„ — 7- 0, CTn(x) -^ and a^^ {x){\ j q(an\x) V hlogan V e{q{an\x)\x)) — )■ as n ^ 00, 

• /3„/a„ — >■ as n -^ 00, 



then 



~i Tir\ — (a I \ 1 — > A/(O,V0(a;)). 



log 
As an example, in the case of the functional Hill and Pickands estimators, we obtain 

. _ .2 



^0FP (X) 



72(x)(22''(^)+i + 1) 




4(log2)2(2'/(^)-l)2' 

Clearly, V^^^[x) is the variance of the classical Pickands estimator, see for instance [3^, Theo- 
rem 3.3.5. 



4 Illustration on simulated data 

The finite sample performance is illustrated on iV = 50 replications of a sample of size n = 500 
from a random pair (X, y), where the functional covariate X e i? = L^p, 1] is defined by X(t) = 
cos(27rZ<) for all t £ [0, 1] where Z is uniformly distributed on [1/4, 1]. Some examples of simulated 
random functions X arc depicted on Figure [1] Besides, the conditional distribution of Y given X 
is a Burr distribution (see Table 1) with parameters t(X) = 2 and X{X) = 2/(8||X||2 — 3) with 
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We focus on the estimation of (j(/3„|a;) with /3„ = 5/n. To this end, the functional Weissman 
estimator q^(/3„|a;) is used with a piecewise linear kernel K{t) = (1.9 — 1.8t)I{t G [0, 1]} and the 
triangular kernel Q' . The conditional tail index is estimated by the functional Hill estimator 7^^. 
The choice of the semi- metric d is a recurrent issue in functional estimation (see [18) . Chapter 3). 
Here, two semi-metrics arc considered. The first one is defined for all {s,t) G E'^ by dx{s,t) ~ 
\\s — t\\2 and coincides with the L2 distance between functions. Remarking that the conditional 
quantile q{an\X) depends only on ||X||2, or equivalently on Z, another interesting seini-metric is 
dzis,t)^\\\s\\l-\m\. 

With such choices, the functional Weissman estimator (j^(/3„|a;) depends on three parameters 
h, A and a„ and on the tj's used to compute 7^^. 

- The smoothing parameter h is selected using the cross-validation approach introduced in j42] and 
implemented for instance in [SJ [3T] : 

{n n 
i=l ] = 1 

where Fn,^i is the estimator (depending on h) given in ([1]) computed from the sample {{Xi, 1^), 1 < 
£ < n, i^i]. Here, U \s & regular grid, H = {hi <h2 < ■ ■ ■ < hm} with hi = 1/100, /im = 1/10 



and M = 20. 

- In our experiments, the choice of the bandwidth A appeared to be less crucial than the other 
smoothing parameter h. It could have been selected with the same criteria as previously, but for 
simplicity reasons, it has been fixed to A = 0.1. 

- The choice of a„ is equivalent to the choice of the number of upper order statistics in the non- 
conditional extreme-value theory. It is still an open question, even though some techniciucs have 
been proposed, see for instance [7] for a bootstrap based method. 

- The selection of the r^'s is equivalent to the selection of an estimator for the conditional tail 
index. Once again, extreme- value theory does not provide optimal solution to this problem. 

In order to assess the impact of the choice of «„ and tj's, we compute the median of the 
L2-errors A^ , r — 1, . . . ,N with 



A^J^ =J2(qj:il3n\X.Y'-^ -q{l3n\X.) 



Here, q^(/3„|Xi )'''■' is the estimation computed on the rth replication and d can be either dx or 
dz- Different values of a„ and r, are investigated: a„ — clog(n)/n with c G {5,10,15,20} and 
Tj ~ (1/j)* with s G {1,2,3, 10}. The results are collected in Table[2] Clearly, the errors obtained 
with the semi- metric dz are always smaller than these obtained with dx ■ The optimal value of c 
depends on the semi-metric whereas the large values of s seem to be the best ones for both metrics. 
Nevertheless, it will appear in the following that the estimations are not (at least visually) very 
sensitive with respect to the choice of q;„ and tj. In Figure [U the estimator q™{f3n\x) is represented 
as a function of Z. The estimator has been computed for two sets of (a„, tj): {an ~ 151og(7i)/n, 
Tj = (1/i)'^) and (a„ = 101og(n)/n, tj ~ (1/j)^) and for the two semi-metrics dx and dz- We 
limited ourselves to the representation of the estimator computed on the replication giving rise 

X (r) 

to the median of the L2-crrors A^ , r = 1, . . . , N. It appears that there is no visual significative 
difference between the two choices of (a„, tj). The choice of the semi- metric seems to be a more 
challenging issue since the results obtained with dz are visually much better than those obtained 
with dx- 



5 Appendix: Proofs 

5.1 Preliminary results 

The following two lemmas are of analytical nature. The first one is dedicated to the control of the 
local variations of the csf when the quantity of interest y goes to infinity. 



Lemma 1 Let x £ E and suppose (A.l) and (A. 2) hold. 

(i) If Hn -^ oo and hlogyn — > as n -^ cxd, then, for n large enough, 



sup 

x'£B{x,h) 



-1 



< 2{Kc + Ky + Ke)h\ogyn. 



F{yn\x') 

(a) If yn — >■ oo and y^ — > oo as n ^>- oo, then, for n large enough. 



sup 

x'eB{xji) 



Hv'nW) 



- 1 



< 



F{yn\x') 
Proof, (i) Assumption (A.l) yields, for all x' 6 B{x, h) 



2/t{x) 



log 



, ( F{yn\x) 



F{yn\x') 



< |logc(x) - logc(a;')| + 



/■^" du 
< Kch + I {k^ + K^)h 



1 



- 1 



1 



l{x) 7(a;') 



+ \£{u\x) - e{u\x')\ ] — 
' u 



< {kc + K-y + Ke)h log y„ 
eventually, from (A. 2). Thus, 



sup 

d{x,x')<h 



log 



f Fjynl 



O{hlogyn)^0 



\F{yn\x' 
as n -> oo and taking account of log(w + l)'^uasii^>0 gives the result. 



(ii) Let us assume for instance y'„ > ?/„. From (A.l) we have 



Fiy'Jx') 



FiynW) 



, ^ -1/7(2;') 
1- ( ^ 

Vn 



exp 






(7) 



Now, x' G B{x, h) and (A. 2) imply for n large enough that 

11 2 

:^ + \e{yn\x')\ < :^ + (A^e + i^^)h + \e{yn\x)\ < ^. 

Replacing in ([7]), it follows that 



F{y'nW) 



F{yn\x') 



, X -2/7(:r) 
<1- ( ™ 
Vn 



The case y^ < j/„ is similar. 



The second lemma provides a second order asymptotic expansion of the quantile function. It is 
proved in [S]. 

Lemma 2 Suppose (A.l) hold. 

(i) Let < /3„ < a„ with a„ — )■ as n — > 00. Then, 

\\ogq{l3n\x) -log9(a„|a::)+7(a:;)log(;3„/a„)| = 0(log(a„/;3„)e(g(a„|a;)|a:;)). 

(ii) If, moreover, liminf /3„/q;„ > 0, then 

^^l^i^ = l + 0{e{q{a^\x)\x)). 



10 



The following Icmnia provides a control on the moments fij {h) for all t > 0, the case r = 1 being 
studied in [TH], Lemma 4.3. 

Lemma 3 Suppose (A. 3) holds. For all t > and x ^ E, 

< Clv.,{h) < fii^Hh) < C^^^ih). 
Proof. From (A. 3), we have 

< Cil{t e [0, 1]} < K{t) < Calji G [0, 1]} 

and thus, for all r > 0, 

< Cll{d{x,X) <h}< K^{d{x,X)/h) < C^I{d{x,X) < h}. 

Taking the expectation concludes the proof. ■ 

The following lemma provides a geometrical interpretation of the condition nipx{h)F{yn\x) — > cxd. 

Lemma 4 Suppose (A.l), (A. 2) hold and let y„ — >■ oo such that /ilogy„ —> as n —> oo . Consider 
the subset of E x W defined as Rn{x) = B{x,h) x (y„,cxD) where x £ E is such that (pxih) > 0. 
Then, P(3i G {1, . . . , n}, (X^, Yi) £ R„{x)) — > 1 as n — >■ cxd if, and only if nipx{h)F{yn\x) — >■ cxd. 

Proof. Since {Xi, Yi), i = 1, . . . ,n are independent and identically distributed random variable, 

F{3i G {1, . . . , n}, (X,, Y,) G Rn{x)) = 1 - (1 - P((X, Y) G i?„(x)))" (8) 

where 

FiiX,Y) £ R,,{x))) = E{l{X£B{x,h)nY>y„}) 
= E{l{X£B{x,h)}F{yr,\X)) 
- F{yn\x)^xih) + F(y„|x)E ("("ll^ - l) I{X £ B{x,h)} 



In view of Lemma [TJi) , we have 

F{yn\X) 



E 
and therefore 



F{yn\x) 



-1 



1{X £ B{x,h)} < 2{kc + K-y + Ke)'fx{h)h\ogyn 



mX,Y) £ Rnix)) = Fiyn\x)ipxih)il + 0{hlogyn)). 
Clearly, this probability converges to as n — > oo and thus ([S]) can be rewritten as 

P(3z G {1, . . . , n}, (X,, Y,) £ i?„(.T)) = 1 - exp (-n(^,(/i)^(2/„|x)(l + o(l))) 

which converges to 1 if and only if ntpx{h)F{yn\x) -^ oo. 
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Let us remark that the kernel estimator ([TJ can be rewritten as Fn{y\x) = 7/'„(y,x)/g„(x) with 

n 

1 " 

9n[x) = y2K{d{x,X,)/h). 

Lemma [5] and Lemma |6] are respectively dedicated to the asymptotic properties of gn(x) and 

i'n{y,x). 

Lemma 5 Suppose (A. 3) holds and let x G E such that ipx{h) > 0. We have: 

(i) E(g„(x)) = l. 

(ii) //, moreover, ipx{h) — > as h ^ then 

< lim inf n(p^(/i) var(g„(x)) < limsupn(p2;(/i) var(5„(x)) < cxd. 

Therefore, under (A. 3), if ipx{h) — > and rnpx{h) — >■ cxd then gn{x) converges to 1 in probability. 
Proof, (i) is straightforward. 
(ii) Standard calculations yields 

nipx{h)va.T{gn{x)) = (px{h) — j^. 1 



and Lemma [3] entails 



{A'{h)Y 



The condition <^x{h) -> concludes the proof. ■ 

Lemma 6 Suppose (A.l) - (A. 4) hold. Let x Cz E such that ipxih) > and introduce ijn.j ~ 
o-jUni^ + 0(1)) for J = 1, . . . , J with < oi < 02 < • ■ • < a,/ and where J is a positive integer. If 
yn — > 00 such that /ilogj/„ — S- 0, A/y„ — >■ and nLpx{h)F{y„\x) — ?> cxd as n ^ cxd, then 

(i) E(^„(y„j,a;)) = F(y„,,»(l + 0(/ilog2/„ V X/yn)), for j ^ 1, . . . , J. 

(ii) The random vector 

is asymptotically Gaussian, centered, with covariance matrix C{x) where Cjj' (x) = a,y^3/ 
for{j,f)e{l,...,J}\ 
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Proof, (i) The {Xi,Yi), i — \, . . . ,n being identically distributed, we have 

1 



!^('0n(yn,j,a;)) 



¥.{K{d{x,X)/h)Q{{Y - y„,,)/A)} 



-j^K{K{d{x,X)/hMQ{{Y~yn,,)/\)\X)} 



Taking account of (A. 4), it follows that 

E{Q{{Y - yr,,,)/X)\X) = F(j/„,,|X) + f Q'{u){F{yr,,, + \u\X) ~ F{y„^j\X))du 
and thus the bias can be expanded as 

Hi'n{yn,j,x)) - F(2/„j|x) =: Ti_„ +T2_„, 
where we have defined 



Tl,n — 
T2,n = 



-^—E{K{d{x,X)/h){F{yr.,,\X) ~ i^(y„,,|x))}, 



1 



/ii''(/i) 



EiK 



W..X)//.)Ffo„,|X)/^q-(.0( ^'g,;+^;^' -l) 



- 1 Mm 



Focusing on Ti_„ and taking account of (A. 3), it follows that 



Tl^n 



-^E{K{d{x,X)/h)iF{y,,^,\X) - F{y^^,\x))I{dix,X) < h}) 

Mi [IT') 



^(^".^■^^^1.-^/. VA/M^^(y"jl^) 



/li'V) 



E{K{d{x,X)/h) 



Piyn,j\x) 



-l]l{d{x,X) <h} 



Lemma [iji) implies that 

I{d{x,X) < h} < 2{Kc + K^ + Ke)h\ogyn,j < 3(Kc + k^ + Ke)h\ogym 
eventually and therefore 



F{yn,j\X) ^ 
F{yn.]\x) 



(9) 



|Ti,„|=F(y„,,»0(/ilogy„). 
Let us now consider T2.n. From Lemma[TJii), for all u e [—1, 1], we eventually have 



(10) 



f (y„,,- + \u\X) ^ 

F{ynjx) 



I{d{x,X) <h} < 



1 



Xu 



2/-i(x) 



- 1 



<c. 



X 



li^)' 



since A/i/„ — > as n -^ oo and where C^(a;) is a positive constant. As a consequence, 
\T2,^\ < C,^,) — ^^E{K{d{x,X)/h)F{y^^,\X)) 

= C^ix) {F{yn,j\x) +Ti^n) = F{ynj\x)0{X/y,i) 

yn,j 



(11) 
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in view of ^TU\\ . Collecting ^, (|10p and ([TT]) concludes the first part of the proof. 
(ii) Let /? 7^ in M.'' and consider the random variable 

""^^^A ^n{x)F{y^M j~-^^^'"' 

where, for all i = 1, . . . , n, the random variable Zi_„ is defined by 

Clearly, {^i,„, i = 1, . . . , n} is a set of centered, independent and identically distributed random 
variables. Let us determine an asymptotic expansion of their variance: 

var(Z.,„) - ^ -.„„,x^....... .w.^Q(m-.«.)/A) 



-/3*i?(x)/3 



/3*S(x)/3, (12) 



-p^ly^k) at, 



where B{x) is the J x J covariance matrix with coefficients defined for {j,j') G {1, . . . , J}^ by 



^j,3' W 



P{yn,j\x)F{ynj>\x)' 

A,^,,ix) = cov{K{d{x,X)/h)Qi{Y -y„,,)/X), K{d{x,X)/h)Q{{Y ~ yn^j,)/X)} 
= E {K^d{x,X)/h)Q{{Y - y„,,)/A)g((y - y„y)/A)} 
- ¥.{K{d{x,X)/h)Q{{Y-y,,,,)/X))¥.{K{d{x,X)/h)Q{{Y~yr,^j,)/\)} 

Let us first focus on T^y. 

T3,„ = E{A-2(ci(x,X)//i)E(Q((r - y„.j)/A)Q((y - y„,,v)/A)|X)} (13) 

and remark that 

nQ{{y - J/„,,)/A)Q((r - y'r.^,)/X)\X) ==: r!(y„,„ y„,,0 + r!(y„,,.,y„,,) 
where we have defined 

^{y,z) = \ [ Q'{(t-y)/\)Q((t-z)/\)F{t\X)dt 

1 

Q'{u)Q{u + {y- z)/\)F{y + u\\X)du. 
-1 
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Let us consider the case j < j' . We thus have aj < aj' and consequently {yn.j — J/nj')/-^ ^^ ^°° ^^ 
ri — > oo. Therefore, for n large enough u+(jjn.j ^Vn.j')/^ < —1 and Q{u + {yn.j -^Unj')/^) = 0. It 
follows that, eventually 51(y„j, yn.j') = 0. Similarly, for n large enough Q{u+ (yn.j' — yn.j)/^) = 1 
and 

^(2/nj',2/«j) = / Q'{u)F{yn^j' +u\\X)du. 



J -1 
For symmetry reasons, it follows that, for all j ^ j' , 

nQiiY -yn.j)/X)Q{(Y -y'„^^)/X)\X) = J Q'{u)F{y^^,^,,+u\\X)du = nQHX ~Vn^j^r)/^)\X). 

and replacing in (|13p yields 

r3,„ = E{A'2(d(a;,X)//i)E(g((y - y„,,vy)/A)|X)} = E{A'2(d(a:,X)//i)Q((r - 2/„jvj')/A)}. 

Now, since iC^ is a kernel also satisfying assumption (A. 3), part (i) of the proof implies 

T3,„ = ^l^^\h)F{y^^Jyy\x){l + 0(/ilogy„ V A/y„)), (14) 

for all j ^ j' . In the case where j = j', by definition, 

n^n=nK\d{x,X)lh)¥.{Q''{{Y-yn.j)l\)\X)] 

where K^ is a kernel also satisfying assumption (A. 3) and where the pdf associated to Q^ satisfies 
assumption (A. 4). Consequently, p^ also holds for j = j' . Second, part (i) of the proof implies 

r4,„ = {^x'i\h)fF{yn^^\x)F{y„.y\x){l + 0{h\ogynyX/yn))■ 

As a consequence, 

Ajjv(.t) = /ii2)(/i)F(y„_j-vy|a;)(l + C'(/ilogy„VA/y„)) 

- {^x'i\h)fF{y,,^,\x)F{yn^y\x){l + 0(/ilogy„ V A/y„)) 

leading to 

Bo.A^) = g.,^' ^^^1 , (1 + 0{h\ogyn V A/2/„) - (/^^ (^))V (y„^,^,-,|x)(l + 0(/ilogy„ V A/y„)) ] 
-r (.ynjAj'FJ \ ^x (h) ) 

In view of Lemma [31 {[v^ \K))'^ j [Vc (h) is bounded and taking account of F{yn.jAj'\x) -^> as 
n -^ oo yields 

Now, from the regular variation property ([3|), it is easily seen that 

FiVn.j^rl^) = a-,Y7("^F(y„|x)(l + o(l)) 
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entailing Bj,ji{x) ~ Cj,j'{x)iix ih)/F{yn\x){l + o(l)). Replacing in (|12p . it follows that 

var(Z,,„) = ^ — ^^^(1 + o(l)), 
n 

for all z =: 1, . . . , n. As a preliminary conclusion, var(^„) -^ l3*C{x)f3 as n — > oo. Conse- 
quently, Lyapounov criteria for the asymptotic normality of sums of triangular arrays reduces 
to X]"=i 11^ l^i-nT = '^IE|Zi„|" — >■ as ?i — > cxD. Next, remark that Zin is a bounded random 
variable: 



nA„(x)/ii '(/i)F(y„,j|a;) 



= 2C,ay-(^)^^5:i/3,|A„(x)(l + o(l)) 
Mi; (/j) j=i 
,/ 
< 2(C2/Ci)2ay^(^'^ |/3,|A„(x-)(l +0(1)); 

in view of Lemma [3] and thus, 

J 

nE|Zi,„|' < 2(C2/Ci)2ay^(")^|/3,|A„(a;)nvar(Zi,„)(l + o(l)) 

J 

= 2(C2/Ci)2ay^'")^ |/3,|/3*C(a;)/3A„(.T)(l + o(l)) ^ 

as n — >■ oo in view of Lemma [3] As a conclusion, ^„ converges in distribution to a centered 
Gaussian random variable with variance /3*C(a;)/3 for all /3 7^ in M"'. The result is proved. ■ 



5.2 Proofs of main results 

Proof of Theorem [H Keeping in mind the notations of LemmalHl the following expansion holds 

K\^)tp. f|fc#-0 - ^^-+.^;--^^^ (15) 

~{ \F{yn,j\x) J gn{x) 



where 



Ai,„ = A-i(x)^/3, 



j=i 



i>n{yn,j,x) -E{-4;„{yn,j,x)) 
' ' F{yn.j\x) 



^^^" ^ A^^(-)L/^.l^ ^(^-^5;^ ^ 

A3,n = 1^/3, A-i(a;)(g„(a:)-l). 
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Let us highlight that assumptions nh'^ipx(h)\og (y„)_F(j/„|a;) — ;■ and nipx{h')F{yn\x) -^ cxd imply 
that hlogyn — >■ as n — > cxd. Thus, from LcmmalBlJii), the random term Ai_„ can be rewritten as 



Ai,„ = \//3*C(x)/3C„, (16) 

where ^„ converges to a standard Gaussian random variable. The nonrandom term A2,ti is con- 
trolled with Lemma ini^i): 

A2,„ = 0{A-\x)ih\ogy„ V A/y„)) = o(l). (17) 

Finally, A3 „ can be bounded by Lemma [5] and Lemma |31 

A3,„ = Op(A-i(x)(n^,(/i))"i/2) = OpiFiy^\x)y/^ = op(l). (18) 

Collecting dUll-ldH]), it follows that 

gn{x)A-\x) X: ft ( ^"/^"-^ '"^ - l") = V^^Ci^U + op{l). 

j^l \ ^ \Vn,j\X) j 

p 

Finally, gn{x) — > 1 concludes the proof. ■ 

Proof of Theorem [2l Introduce for j = 1 , . . . , J, 

O'ri.jix) = q{an,j\x)an{x), 
Vnjix) = a~]7(a;)cr,;^(a;), 

Wn,j{x) = Vnj{x) (F„{q{an^j\x) + <7n^j{x)Zj\x) - F{q{an,j\x) + <7nj{x)Zj\x) 

an,j{x) = Vn,j{x) (a„j - F{q{an,j\x) + an,j{x)zj\x)) , 
and Zj G M. Let us study the asymptotic behavior of J-variate function defined by 

$„(zi,...,zj)=P fl {a-](a;)(g„(a„,,|x)-g(a„,j|x)) <z,}\ =P fj {VK„j(a;) < a„,,(x)} 

We first focus on the nonrandom term a„j(a;). Under (A.l), F{.\x) is differentiable. Thus, for all 
j e {1, . . . , J} there exists 9nj G (0, 1) such that 

F{q{anj\x)\x) - F{q{a,ij\x) +ct„j(x)zj|x) = -CT„j(x)zjF'(q„j |x), (19) 

where qnj = q(a„.j|x) + 6„,jcrnj{x)zj. It is clear that q{an.j\x) — ?► 00 and anj{x)/q{an,j\x) — )■ 
as n — > 00. As a consequence, g„j — )■ 00 and thus (A.l) entails 

lim '"-^f^^'i"'"^ = -iMx). (20) 
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Moreover, sinec (7„.j ~ g(a„j |a;)(l + o(l)) and F{.\x) is regularly varying at infinity, it follows that 
F{qn,j\x) = F{q{an,j\x)\x){l +o(l)) = anj(l + o(l)). In view of (HH) and ([20]), we end up with 



7(x)g(a„j|a;) 



-7(2:) 



Let us now turn to the random term Wnj{x). Defining aj = t- , j/„ j = q{anj\x) + cr„j(a;)zj 

for j = 1,..., J and 2/„ = q(a„|x), we have y„j = q{a„j\x){l + o(l)) = aj2/„(l + o(l)) since 
(7(.|x) is regularly varying at with index ~j(x). Using the same argument, it is easily shown that 
logj/n = —lix) log(Q;„)(l + 0(1)). As a consequence, Theorem [T] applies and the random vector 



converges to a centered Gaussian random variable with covariance matrix C{x). Taking account 
of (|2ip . we obtain that $„(zi, . . . , zj) converges to the cumulative distribution function of a cen- 
tered Gaussian distribution with covariance matrix j'^(x)C{x) evaluated at (zi, . . . , zj), which is 
the desired result. H 

Proof of Theorem [3l The proof is based on the following expansion: 

^"'^"^^ (l0g(C(/3n|:c)) - log{q(j3^\x))) = , y^";] A QnS + Qn.2 + Qn,^) 



l0g(a„//?„)^ ov-nv-'H n ov.v^^'M n, log(«„//3, 

where we have introduced 



Qns = CT„^(a;)(7„(x) -7(2;)), 

)g{an/P 



Qn.2 = -, — f — ----\og{qn{an\x)/q{an\x)), 



in,3 



log(a„/AO 



(logg(a„|x) -logg(/3„|x) + 7(2;) log(a„//3„)). 



First, Qn_i — > A/'(0, V{x)) as a straightforward consequence of the assumptions. Second, Theo- 

p 

rem [2] implies that g„(a„|a;)/g(a„|x) — > 1 and 

log(a„/p„) \ q[an\x) J iog{an/Pn) 

Consequently, Qn.2 — ^ as n — > 00. Finally, from Lemma [2ji), Qn.?, = 0(cr^^(a;)e(g(a„|a:)|a;)), 
which converges to in view of the assumptions. ■ 



Proof of Theorem |4l The following expansion holds for all j = 1, . . . , J: 

\ogUr,an\x) = logg(a„|x) +log (S^Zl^) + bg (^I2^^ . (22) 

V q{an\x) J \ q{Tjan\x) ' 
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First, Lemma [21^ii) entails that 

^^^/ qir.a \x) \ ^^(^)iog(i/ ) + o(,(^(„^^|^)|^))^ (23) 

V q[an\x) ) 

where the 0(e(q(a„|a;)|a;)) is not necessarily uniform in j = 1,..., J. Second, it follows from 
Theorem [2] that 

where (C)i,ii • ■ • :Cn,j)* converges to a centered Gaussian random vector with covariance matrix 
72(a;)I]. Replacing ^ and ([Ml) in dH]) yields 

logg„(Tja„|a;) = \o%q[a.a\x) + 7(2;) log(l/rj) +cr„(a;)^„j + 0(e(g(a„|a;)|x)), 

for all J = 1, . . . , J and therefore, in view of the shift invariance property of </>, we have 

({log (7„(r,-a„|.T)}j=i,. ..,,/) = (/)({7(a;)log(l/T,) +cr„(a;)^„j + 0(£((7(a„|x)|a;))}j=i^...,j) . 

A first order Taylor expansion yields: 

./ ^ , 

0({logq„(Tja„|a;)}j=i^...^,7) = (7(a;)w) + ^(cr„(a;)^„j + 0(e(q(a„|a;)|x))) — (7(x)u) 

j=i -^ 

+ Op ^(an{x)in,3 + 0{e{q{an\x)\x))f 
Thus, under the condition a^^ {x)e{q{an\x)\x) — > as n — >■ 00, it follows that 

J r,, 

'^n^{x){(t){{\0gqn{Tjan\x)}j = i,...j) - (f) {^i {x)v)) ^^£.n,] -q^{i{x)v) + Op{l) . 

j = l 1 



Taking into account of the scale invariance property of 0, we finally obtain 
and the conclusion follows. 



a-\xM{^)-^ix)) = J-^^^„,,.^(7(x)z;) +op(l) 
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F{y\x) 


-fix) 


c(x) 


e{y\x) 


Parcto 
Cauchy 
Frechet 
Burr 


y-e{x) 
-tan-i(l/y) + -(l-sigii(y)) 

TT 2 

l-exp(-y-^(^') 

(1 + ?;'^(^))^-^(^) 


1 

e{x) 
1 

1 

e{x) 


1 
1 

4 
1-e-i 
2-Hx) 




^(2"^y-«(-)(l + o(l)) 
X{x)T{x)y-(-Hl + o{l)) 


X{x)Tix) 



Table 1: Examples of distributions satisfying (A.l). Their parameters 0{x), t{x) and A(a;) are 
positive. 





c=5 


c= 10 


c=15 


c = 20 


s = 1 


d = dx 


13457 


854 


756 


913 




d = dz 


747 


527 


589 


618 


5 = 2 


d = dx 


1792 


531 


435 


472 




d = dz 


420 


360 


347 


309 


s = 3 


d = dx 


1225 


450 


396 


352 




d = dz 


329 


304 


261 


242 


s = 10 


d — dx 


CXD 


616 


359 


242 




d = dz 


67267 


116 


464 


840 



Table 2: Median of the La-errors A^;''' for a„ = clog(n)/n, c e {5,10,15,20} and t,- = (1/j)" 
s e {1, 2, 3, 10} for the two semi-metrics dx and dz- 
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Figure 1: Four realizations of the random function X{.). 





Figure 2: Comparison of the estimated quantile g^(/3„|x) corresponding to the median error with 
the true quantile function (continuous line). Horizontally: Z, vertically: quantiles. Two sets 
of (a„,rj) arc considered: (a„ ~ 151og(n)/n, tj = (l/j)^, dashed line) and (q;„ = 101og(n)/n, 
Tj ~ (1/j)^, dotted line). Top: semi-metric dz, bottom: semi-metric dx- 
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